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ABSTRACT 

MicroRNAs (miRNAs) are small non-coding RNAs 
that regulate gene expression through RNA interfer- 
ence. Human miRNAs are generated through a 
series of enzymatic processing steps. The precursor 
miRNA (pre-miRNA) is recognized and cleaved by a 
complex containing Dicer and several non-catalytic 
accessory proteins. HIV TAR element binding 
protein (TRBP) is a constituent of the Dicer 
complex, which augments complex stability and po- 
tentially functions in substrate recognition and 
product transfer to the RNA-induced silencing 
complex. Here we have analysed the interaction 
between the RNA-binding region of TRBP and an 
oncogenic human miRNA, miR-1 55, at different 
stages in the biogenesis pathway. We show that 
the region of TRBP that binds immature miRNAs 
comprises two independent double-stranded RNA- 
binding domains connected by a 60-residue flexible 
linker. No evidence of contact between the two 
double-stranded RNA-binding domains was 
observed either in the apo- or RNA-bound state. 
We establish that the RNA-binding region of TRBP 
interacts with both pre-miR-1 55 and the miR-1 55/ 
miR-155* duplex through the same binding 
surfaces and with similar affinities, and that two 
protein molecules can simultaneously interact with 
each immature miRNA. These data suggest that 
TRBP could play a role before and after processing 
of pre-miRNAs by Dicer. 



INTRODUCTION 

The majority of human micro-RNAs (miRNAs) are 
produced from long RNA transcripts in a two-step 
process (1,2). The initial primary miRNA (pri-miRNA) 
transcript can be 1-2 kilobases (kb) in length. Within the 
pri-miRNA, a short, imperfect stem-loop structure is 
recognized and excised by the nuclear 'Microprocessor' 
complex. The product is a 60-70-nucleotide (nt) precursor 
miRNA (pre-miRNA), which is exported from the nucleus 
by an Exportin-5/Ran-GTP complex. In the cytoplasm, 
the pre-miRNA is further processed by the Dicer 
complex into a 20-25-base-pair (bp) duplex, which 
contains the final miRNA and its reverse complement, 
termed miRNA*. The imperfect miRNA/miRNA* 
duplex is then transferred to the RNA-induced silencing 
complex (RISC), which is the mediator of miRNA- 
directed RNA interference in humans. 

The multidomain human TAR element binding protein 
(TRBP) is a component of the Dicer complex and has 
been implicated in the processing of pre-miRNAs (3,4). 
TRBP was initially discovered as a protein that associates 
with the HIV TAR RNA (5) and was later demonstrated 
to interact with and regulate interferon-induced protein 
kinase R (PKR), a kinase involved in the cellular 
response to viral infection (6,7). In the context of 
miRNA biogenesis, TRBP associates with Dicer, the 
RNase III enzyme that catalyses the removal of the 
apical loop of pre-miRNAs (8) and processes small 
interfering RNAs [siRNAs; (9)]. TRBP has been shown 
to enhance dicing and augment Dicer stability (3,4,10), 
although the molecular mechanisms for these activities 
have not been elucidated. Downstream of Dicer, TRBP 
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is also a component of the RISC-loading complex (RLC), 
the protein assembly responsible for transferring the 
miRNA/miRNA* duplex from Dicer to RISC (11,12). 
TRBP contains three double-stranded (ds) RNA-binding 
domains (dsRBDs), but only the first two dsRBDs interact 
with dsRNA. The third dsRBD is believed to mediate 
protein/protein interactions and bind to other proteins in 
the Dicer complex. The contribution of TBRP to the 
function of either the Dicer complex or RLC is not well- 
understood. The presence of two types of dsRBD supports 
the hypothesis that TRBP mediates both protein/RNA and 
protein/protein contacts, although the targets, location 
and consequences of these interactions in the context of 
the Dicer complex or RLC remain to be elucidated. 

Dicer homologues are found in Drosophila melanogaster 
and Caenorhabditis elegans, and they associate with 
proteins with a similar domain composition to TRBP. 
Proteins such as R2D2 (13) and Loquacious (14,15) in D. 
melanogaster and RDE-4 (16) in C. elegans all contain two 
N-terminal canonical RNA-binding dsRBDs as well as a 
non-canonical C-terminal dsRBD, which is implicated in 
protein-protein interactions with the respective Dicer 
homologue. In plants, Dicer-like 1 (DCL1) is found in 
complex with HYPONASTIC LEAVES 1 (HYL1), an ac- 
cessory protein, which contains only two dsRBDs (17). 
HYL1 interacts with pre-miRNA and miRNA/miRNA* 
duplexes primarily through its first dsRBD (18,19). 
Human Dicer has also been shown to associate with a 
TRBP homologue, Protein Activator of PKR [PACT; 
(20,21)]. A complex containing Dicer, TRBP and PACT 
can be purified from human cells (21). PACT also contains 
three dsRBDs and, like TRBP, is implicated in the regu- 
lation of PKR. However, unlike TRBP, PACT activates 
PKR (22). 

miRNAs regulate gene expression at the post- 
transcriptional level (1,2). To date, M500 human 
miRNAs have been documented in the database of 
miRNAs, miRbase (23). miRNAs are believed to 
regulate the expression of upwards of 30% of human 
genes by enabling RISC to recognize complementary se- 
quences in target messenger RNAs. Many miRNAs have 
oncogene- or tumor-suppressor-like behaviour (24). 
Mis-expression or mutation of miRNAs can have consid- 
erable cellular consequences (25). For example, miR-155 
has been implicated in lung and breast cancer as well as 
different leukaemias and lymphomas (24). Mutations in 
the proteins involved in the processing or regulation of 
miRNA biogenesis have also been linked to different 
cancers. For example, certain frame-shift mutations 
implicated in colorectal or endometrial cancers result in 
truncated forms of TRBP that lack dsRBDs 2 and 3 (26). 
Furthermore, TRBP is phosphorylated by the mito- 
gen-activated protein kinase Erk (27), and it has been sug- 
gested to interact with the anti-cancer drug enoxacin (28). 
These recent studies have provided further evidence of the 
importance of TBRP in miRNA biogenesis. 

Over the past decade, there has been a drive to under- 
stand the structural biology of miRNA biogenesis (29,30). 
Nuclear magnetic resonance (NMR) spectroscopic and 
X-ray crystallographic studies of TRBP have elucidated 
the isolated 3D structures of the first two dsRBDs (31) 



as well as the second dsRBD in complex with a short 
siRNA duplex (18). Negative-stain electron microscopy 
studies of miRNA biosynthesis complexes containing 
TRBP have also been reported; however, the level of reso- 
lution is not yet sufficient to accurately describe the 
location of TRBP in these complexes or determine how 
it interacts with other components (32,33). Many bio- 
chemical studies have sought to explore the mechanism 
for RNA recognition by TRBP or examine how TRBP 
associates with Dicer (3,4,10-12,34,21). However, despite 
these efforts, the exact role of TRBP in immature miRNA 
recognition and processing remains unclear. 

Here we present an in-depth biophysical analysis of the 
complex formed between the RNA-binding region of 
TRBP and biosynthetic precursors of the oncogenic 
human miRNA miR-155. We establish that the two 
dsRBDs that form the RNA-binding region function in- 
dependently in apo- and RNA-bound forms. We show 
that this region of TRBP interacts with both pre-miR- 
155 and miR-155/miR-155* with affinities in the low 
micromolar range and that it can form a large complex 
in which four dsRBDs interact with a single immature 
miRNA molecule. These data, which concern functional 
regions of the protein and their interactions with relevant 
biological targets, further enhance our understanding of 
the role of TRBP in miRNA biogenesis. 

MATERIALS AND METHODS 

Protein expression and purification 

Residues 19-99 (TRBP-D1; monomeric molecular weight: 
8.7 kDa), 157-228 (TRBP-D2; 8.2 kDa) or 19-228 
(TRBP-D12; 22.6 kDa) of TRBP were sub-cloned into 
pLX06 (Protein'eXpert, France) using a codon-optimized 
TRBP2 (AAA36765) cDNA template (GeneArt). Each 
construct was expressed with an N-terminal 6-histidine 
purification tag and a tobacco etch virus cleavage site. 
Protein constructs were expressed and purified as previ- 
ously described (35). 

Large-scale production of pre-miR-155 and 
miR-155/miR-155* 

Pre-miR-155 (5'-GGAAU GCUAA UCGUG AUAGG 
GGUUU UUGCC UCCAA CUGAC UCCUA CA 
UAU UAGCA UUCCC A-3') or miR-155/miR-155* 
(miR-155: 5'-GGAAU GCUAA UCGUG AUAGG 
CCU-3'; miR-155*: 5'-GGCCU ACAUA UUAGC 
AUUCC CA-3') were produced by in vitro transcription 
using in-house produced T7 RNA polymerase and fully 
complementary ds DNA oligonucleotides containing a T7 
promoter. As each translated RNA sequence was required 
to start with a GG dinucleotide, it was necessary to 
modify the native RNA sequences. Additional sequence 
changes were required to preserve the Watson-Crick 
base-pairing. Non-native sequence nucleotides are 
underlined in the sequences provided previously. 

Template oligonucleotides (Eurogentec) were annealed 
by incubation at 95°C for 5 minutes. The transcription 
reaction was performed in a standard T7 RNA polymer- 
ase buffer (40 mM Tris-HCl pH 8.1, ImM spermidine, 
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5mM dithiothreitol, 0.1 % Triton X-100, 40 ug/ml bovine 
serum albumin, 80mg/ml polyethylene 8000, 4mM of 
each nucleotide triphosphate, 15-20mM MgCl 2 , 35 |ig/ml 
DNA matrix, 0.1 mg/ml T7 RNA polymerase and 1 U/ml 
pyrophosphatase). The reaction mixture was incubated at 
37°C for 3 hours. RNA products were purified by poly- 
acrylamide gel electrophoresis, dialyzed multiple times 
against water and lyophilized. The folding of each RNA 
was achieved by heating each sample in water at 96°C for 
5 minutes, followed by rapid cooling induced by the 
addition of ice-cold 2x NMR buffer. All RNA samples 
were analysed by ID ( 1 H) NMR spectroscopy. 

NMR spectroscopy 

(^H^C^N) backbone resonance assignments of TRBP 
constructs in complex with either pre-miR-155 or 
miR-155/miR-155* were obtained using a combination of 
3D BEST triple resonance NMR experiments (36) and ti- 
tration experiments. (^H^C^N) backbone resonance as- 
signments of apo-TRBP-D12 were taken from our previous 
work [BioMagResBank code: 18324; (35)]. Assignments of 
TRBP-D12 in complex with pre-miR-155 were obtained by 
analysing 3D CH, 13 C, 15 N) HNCA and HNCO 
experiments, which were acquired using a 230 jiM 
[t/- 2 H, 13 C, 15 N]-labelled sample of TRBP-D12 and a 
3.5-fold excess of unlabelled pre-miR-155. The results of 
titration experiments (see later in the text) were also used 
to aid assignment of TRBP-D12 in complex with pre-miR- 
155 or in complex with miR-155/miR-155*. 

Apo- and RNA-bound states of the different TRBP 
constructs were analysed using 2D BEST ( ! H, 15 N) 
TROSY experiments at various protein concentrations 
(0.4 to 0.8 mM) and RNA/protein ratios. Titration experi- 
ments were performed with the following RNA/protein 
ratios: (i) TRBP-D12 with 0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.75, 
1.0, 1.25, 1.5, 1.75, 2.0 and 2.3 molar equivalents of 
pre-miR-155 or miR-155/miR-155*; (ii) TRBP-D1 with 
0, 0.1, 0.22, 0.29, 0.33, 0.4, 0.5, 0.75, 1.0 and 1.5 molar 
equivalents of pre-miR-155; or (iii) TRBP-D2 with 0, 0.1, 
0.5, 0.75, 1.0 and 1.5 molar equivalents of pre-miR-155. 
The time required to collect individual titration points 
depended on the protein concentration and the 
signal-to-noise ratio. 

All NMR experiments were performed at 25° C using a 
Varian (Agilent) DirectDrive 800 MHz spectrometer 
equipped with a triple resonance cryogenic probe. NMR 
spectra were processed using NMRpipe (37) and analysed 
using CCPNMR Analysis, version 2.1.5 (38). Compound 
chemical shift differences, A5, between free protein and 
RNA-bound protein were calculated using the following 
equation: A5 = [AS^ + (A5 N /6.5) 2 ] 1/2 , where A5 X is the 
different in chemical shift between comparable resonances 
in different 2D spectra (39). 

Multi-angle laser light scattering 

Fifty-microlitre samples of TRBP-D1 (1.8 mg/ml), 
TRBP-D2 (1.7 mg/ml), TRBP-D12 (2 mg/ml) and 
pre-miR-155 (4 mg/ml) were loaded onto an analytical 
Superdex S200 size exclusion chromatography (SEC) 
column (GE Healthcare) pre-equilibrated with NMR 



buffer. SEC was performed at 0.5 ml/min with an in-line 
multi-angle laser light scattering (MALLS) spectrometer 
(DAWN EOS, Wyatt Instruments) with 18 angles 
measuring at 690 nm. An in-line refractive index detector 
(RI 2000, Spares) was used to follow the differential re- 
fractive index relative to the solvent (1.34). Masses were 
estimated with the Debye model using ASTRA software 
version 5.3.4.20 (Wyatt Instruments) using a theoretical 
dn/dc value of 0.185ml/g. 

Sedimentation velocity analytical ultracentrifugation 

Sedimentation velocity experiments were performed with a 
Beckmann Coulter XLI analytical ultracentrifuge at 
50 000rpm (185 000g), 25°C with an An-50Ti rotor. 
Samples or reference (NMR buffer) were loaded into 
sapphire-windowed cells with 12-mm optical paths, 
which had previously been treated with RNase ZAP 
(Sigma- Aldrich). Acquisition was performed at 260 nm 
and 280 nm, and using interference optics. The molar 
masses for the polymers were calculated from their com- 
position. A partial specific volume (v) of 0.74 ml/g was 
estimated for TRBP-D12 using the software 
SEDNTERP version 2 (40). The v used for pre-miR-155 
(0.508 ml/g) was taken from a study of a viral RNA in 
dilute aqueous KC1 solutions (41). The data were analysed 
in SEDFIT [Version 12.52; (42)] using the c(s) analysis and 
non-interacting species models. The latter was used to 
estimate buoyant molar masses, Mb, using the equation, 
M h = M x [1 — (p x v)], where M is the molar mass and 
p the solvent density. Hybrid models and multi- 
wavelength analysis were performed in the program 
SEDPHAT (43). Results in terms of buoyant mass and 
multi- wavelength analysis are given for the more 
concentrated sample, which yielded higher signal-to-noise 
ratios. The mass density (1.005 g/ml) and viscosity (0.898 
cP) of the buffer were calculated using SEDNTERP 
(v20 120328 beta, www.jphilo.mailway.com). 

The v for possible protein/RNA complexes was 
estimated by summing the individual v values of each com- 
ponent weighted by their fractional mass in the complex 
(Supplementary Table SI). The sedimentation coefficient, 
s, of each complex estimated from the distribution of sedi- 
mentation coefficients, c(s), was not affected by the choice 
of the three possible values. c(s) profiles reported here 
were fitted with a value of 0.64 ml/g. The Svedberg and 
Stokes equations were used to analyse the value of the 
experimental sedimentation coefficients. 

Isothermal titration calorimetry 

All isothermal titration calorimetry (ITC) experiments 
were conducted using a MicroCal ITC-200 at 25°C. ITC 
experiments were performed in two ways: either with the 
'ligand', in this case the protein, in the cell, or with 'ligand' 
in the syringe. Protein concentrations varied between 30 
and 60 (iM in the sample cell and 250 and 500 (iM in the 
syringe. RNA concentrations varied between 25 and 
40 jiM in the sample cell and 150 and 300 uM in the 
syringe. The heat evolved after each injection of protein 
or RNA was obtained from the integral of the 
baseline-corrected calorimetric signal. The equations 
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Figure 1. Analysis of the solution behaviour of dsRBDs from TRBP. (a) Diagram of the three constructs of TRBP used in this study. The same 
colour scheme for TRBP-D1 (red) and TRBP-D2 (green) is used throughout; (b) analytical SEC profiles showing elution volume vs. absorbance at 
280 nm, A 2 8o> for TRBP-D1 (red), TRBP-D2 (green) and TRBP-D12 (black). The molecular weight of each species determined by MALLS is given; 
(c) site-by-site comparison of the (^H^N) chemical shift of a residue in the single domain and the corresponding residue in TRBP-D12. Circles 
denote ( 15 N) chemical shifts and squares ( ! H) chemical shifts. Amide groups from TRBP-D1 are coloured in red and those from TRBP-D2 in green. 
The points corresponding to the large outliers are annotated in grey. RMSD values for H and 15 N chemical shifts are provided. 



used to analyse ITC data of TRBP-D12 were derived from 
the classical sequential binding equation using activity 
coefficients of 1. Previous assumptions of independency 
or co-operativity were not made. The resulting binding 
isotherms were fitted to maximize the likelihood of the 
data assuming a Gaussian noise distribution. The titration 
data were globally fitted for each pair of experiments using 
previously described protocols (44). The standard devi- 
ation of the estimation of each binding parameter was 
determined by computing 100 Monte Carlo simulations 
of each experiment performed assuming a Gaussian 
noise of 70 nanocals (estimated from our data) followed 
by a global fitting of the data. The stoichiometry for each 
protein/RNA pair was estimated by comparing the fits of 
sequential models for 1:1 and 1:2 protein/RNA complexes 
(see Supplementary Material). 



RESULTS 

Two independent dsRBDs comprise the RNA-binding 
region of TRBP 

Three constructs derived from the RNA-binding region 
of TRBP were produced (Figure la) and characterized 
by SEC. Analysis of the eluted protein by SEC-MALLS 
reported values of 8.5 ± 0.5 kDa, 8.4 ± 0.5 kDa and 
23.2±0.9kDa for TRBP-D1, TRBP-D2 and TRBP- 
D12, respectively (Figure lb). These values correspond 
well with the respective monomeric masses of each 
protein construct. 

Two dimensional ^H^N) HSQC spectra of each of the 
three constructs displayed good l H resonance dispersion, 
which indicated that the dsRBDs adopt folded 3D struc- 
tures. The spectrum of the double-domain construct, 
TRBP-D12, can be reconstituted by overlaying the 
spectra of TRBP-D1 and TRBP-D2 (Supplementary 
Figure Sla-d). The additional peaks visible in the 
spectrum of TRBP-D12 correspond to residues 96-151, 
which are located in the linker region. The ^H^N) res- 
onance frequencies of a given residue in a single-domain 



construct compare well with their counterparts in the 
double-domain construct, with root mean squared 
deviation (RMSD) values of 0.02 and 0.09 ppm for l H 
and 15 N frequencies, respectively (Figure lc). The largest 
deviations in compound chemical shift differences between 
comparable sites are predictably found at the C-terminus 
of TRBP-D1 (Supplementary Figure Sle). These differ- 
ences can be easily attributed to the presence of the 
inter-domain linker in the double-domain construct. The 
similarity in corresponding NMR signals strongly suggests 
that the two domains do not interact in the tandem 
dsRBD construct. This observation is consistent with pre- 
viously reported ( ! H, 15 N) heteronuclear NOE data, which 
demonstrated the flexibility of the inter-domain linker 
(35). Taken together, these data suggest that the 
RNA-binding region of TRBP is composed of two inde- 
pendent dsRBDs connected by a flexible linker and, there- 
fore, that it is similar to the tandem dsRBDs that form the 
dsRNA-interacting region of PKR (45). 

Both dsRBDs of the RNA-binding region of TRBP 
interact with pre-miR-155 

Large, site-specific changes in ('H, 15 N) chemical shifts are 
observed when unlabelled pre-miR-155 is added to con- 
structs of TRBP (Figure 2a). Backbone resonance assign- 
ments were obtained of TRBP-D12 in the presence of an 
excess of pre-miR-155. Using these sequence-specific as- 
signments, it was possible to evaluate which regions of 
each dsRBD were involved in the formation of protein/ 
RNA complexes. A comparison of the amide chemical 
shifts of apo- and RNA-bound TRBP-D12 revealed 
large RMSD values between the two data sets (0.11 ppm 
for ! H and 0.54 ppm for 15 N frequencies; Supplementary 
Figure S2). The largest chemical shift changes observed on 
complex formation are restricted to amino acids located in 
the two dsRBDs. Only negligible chemical shift changes 
occur in the linker region (Figure 2a-c). The profiles of 
residue-by-residue chemical shift changes of the two 
dsRBDs in TRBP-D12 are highly comparable 
(Figure 2a-c). 
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Figure 2. NMR analysis of the interaction between TRBP-D12 and pre-miR-155. (a) Compound chemical shift values (A5) measured when 
TRBP-D12 interacts with pre-miR-155 plotted as a function of residue number (black). A5 = [A5^ + (A5 N /6.5) 2 ]' /2 , where A5 X is the difference in 
chemical shift between apo- and RNA-bound spectra (39). Prolines and unassigned residues have been given values of —0.05 and — O.lppm, 
respectively. For comparison, the change in accessible surface area (AASA) when TRBP-D2 forms a complex with dsRNA is shown (green). 
A ASA were calculated using the POPS* server (46) from 3ADL (18). The boundaries of secondary structure elements were taken from the 3D 
structures of TRBP-D1 [3LLH; (31)] and TRBP-D2 [3ADL; (18)]; (b,c) TOP: cartoon representation of the 3D structures of TRBP-D1 and 
TRBP-D2 showing the three regions in canonical dsRBDs that are implicated in dsRNA binding; (b,c) BOTTOM: A5 values plotted on the 
respective 3D structures. Each amide nitrogen is represented by a sphere and coloured according to the A 5 scale provided. The P2/3 loop was 
not resolved in the 3D structure of TRBP-D1 and is shown by a broken line. 



Three critical regions in canonical dsRBDs mediate the 
interaction with dsRNA (47): region 1, oc-helix 1 (oci); 
region 2, the loop between strands (3 2 and p 3 (P2/3); and 
region 3, the N- terminus of oc 2 (Figure 2b and c). The 
NMR chemical shift perturbation profiles showed that 
both TRBP-D1 and TRBP-D2 interact with pre-miR- 
155 through the classical dsRBD interaction site. 
Residues located in regions 1, 2 and 3 exhibit the largest 
changes in chemical shift or disappear entirely from the 
spectrum, even after the addition of 2.3-fold excess of 
pre-miR-155 (Figure 2). These chemical shift changes are 
consistent with previously published structural and bio- 
chemical analyses of other dsRBD/dsRNA complexes 
(31, 47-50). The 3D structure of TRBP-D2 in complex 
with a short siRNA was reported in 2010 (18). The 
residue-by-residue changes in solvent accessible surface 
area, AASA, that occur on the formation of this 
complex highlight the location of the three critical RNA 
interaction regions in TRBP-D2 (Figure 2a). The profile 
of AASA for the TRBP-D2/siRNA complex and A5 
observed for dsRBD-2 in the TRBP-D12/pre-miR-155 
complex correspond well. These observations suggest 
that the interaction between TRBP-D2 and the imperfect 
stem-loop of pre-miR-155 in solution is similar to that 
reported for a perfect A-form dsRNA duplex and, further- 
more, that highly similar protein/RNA contacts are 
formed in the context of the tandem dsRBD construct. 

Titration of [ 15 N]-labelled TRBP-D1 or TRBP-D2 with 
unlabelled pre-miR-155 revealed that both of the isolated 
dsRBDs of TRBP can interact with pre-miR-155. 
Two-dimensional ^H^N) HSQC spectra of either 
TRBP-D1 or TRBP-D2 showed clear and significant 
chemical shift perturbations after the addition of RNA. 
NMR data from sub-equimolar titration points were 
characterized by severe line-broadening. Many cross- 
peaks broaden beyond detection after the addition of 0.1 
equivalents of RNA and reappear when the RNA/protein 



ratio exceeds 0.3-0.5 (Supplementary Figure S3). 
Two-dimensional ^H^N) HSQC spectra of TRBP-D12 
in complex with pre-miR-155 superpose nicely with 
spectra of either of the single dsRBDs in complex with 
the same RNA (Figure 3a-d). As was the case for the 
apo-protein (Supplementary Figure SI), the spectrum of 
the double-domain construct in complex with pre-miR- 
155 is essentially a superposition of the spectra of the in- 
dividual dsRBDs in complex with the same RNA 
(Figure 3a-d). Site-by-site differences in ^H^N) reson- 
ance frequencies between single-domain and double- 
domain data sets are very small, with RMSD values of 
0.02 and 0.1 1 ppm for ! H and 15 N frequencies, respectively 
(Figure 3e). That is, a cross-peak for an amide group in 
the TRBP-D1 /pre-miR-155 complex, for example, has 
almost exactly the same ^H^N) resonance frequencies, 
and therefore experiences a near-identical local chemical 
environment, as the comparable amide group in the 
tandem domain/pre-miR-155 complex. This similarity of 
resonance frequencies between single- and double-domain 
constructs suggests that each dsRBD forms essentially the 
same interaction regardless of whether it is a component 
of the tandem domain construct. If the interaction with 
pre-miR-155 led to (or resulted from) the formation of 
contacts between the two dsRBDs, this would result in 
chemical shift differences between NMR spectra of the 
single- and double-domain constructs. However, the 
small RMSD between these data sets suggests that each 
dsRBD in TRBP-D12 interacts independently with 
pre-miR-155. 

The RNA-binding region of TRBP can form a 2:1 
complex with pre-miR-155 

ITC experiments were conducted to further characterize 
the complex formed between the dsRBDs of the RNA- 
binding region of TRBP and pre-miR-155. The resulting 
binding isotherms were globally fitted and best described 
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Figure 3. Comparison of NMR data collected of single- and double-domain constructs in complex with pre-miR-155. Two dimensional (^H^N) 
spectra of (a) TRBP-Dl/pre-miR-155 in red, (b) TRBP-D2/pre-miR-155 in green, and (c) TRBP-D12/pre-miR-155 in black. A coloured graphical 
representation of each construct is provided above the spectrum, (d) Superposition of the spectra shown in (a-c) retaining the same colour scheme 
and construct graphics, (e) Comparison of the ( ! H, 15 N) resonance frequencies of a residue in the single domain and the corresponding residue in 
TRBP-D12. Circles denote ( 15 N) chemical shifts and squares ( ! H) chemical shifts. Amide groups from TRBP-D1 are coloured in red and those from 
TRBP-D2 in green. The points corresponding to the large outliers are annotated. RMSD values for ! H and 15 N chemical shifts are provided. 



by a two-site sequential binding model in which the first 
molecule of TRBP-D12 binds pre-miR-155 approximately 
10 times more strongly than the second molecule 
(K d?1 = 0.5 ± 0.03 uM; K d?2 = 6.2 ± 0.2 nM; Figure 4a 
and Supplementary Table S2). The difference in affinity 
measured for the two molecules of TRBP-D12 is most 
likely due to a decrease in the number of sites available 
for the second binding event with respect to the first. The 
interaction between TRBP-D12 and pre-miR-155 exhibits 
an exothermic character, which suggests that polar inter- 
actions are the primary driving force of the binding event 
and which is consistent with the electrostatic nature of 
interactions between dsRNAs and dsRBDs (47). 

A comparison between the binding modes of single- and 
double-domain constructs was also performed. ITC data 
obtained from the titration of TRBP-D1, TRBP-D2 or 
TRBP-D12 with pre-miR-155 were analysed using the 
standard single-site model. The resulting macroscopic dis- 
sociation constant, within the context of a identical 
independent sites model for the complex formed between 
TRBP-D12 and pre-miR-155, is 3.4±0.8uM 
(Supplementary Figure S4b and c and Supplementary 
Table S2). ITC analyses of the interactions between 
single-domain constructs and pre-miR-155 revealed that 
both single-domain constructs, TRBP-D1 and TRBP-D2, 
interact with pre-miR-155 to form 4:1 complexes 
(Supplementary Figure S4b and c and Supplementary 
Table S2). All macroscopic binding constants measured 
were in the mid-to-low micromolar range, consistent with 
results of TRBP-D12. TRBP-D2 (K d = 3.7 ± 0.5 uM) 
interacts 4 times more strongly than TRBP-D1 (15.4 ± 
0.8 uM). Like TRBP-D12, both TRBP-D1 and TRBP-D2 
exhibit an exothermic binding enthalpy, again suggesting 
that polar interactions mediate binding. Furthermore, the 
macroscopic binding enthalpy obtained for TRBP-D12 is 
approximately equal to the sum of that of TRBP-D1 and 
TRBP-D2, which indicates that the polar interactions 
formed by each single domain and pre-miR-155 are 



unaffected in the context of the full-length protein. Thus, 
these ITC data support the conclusions derived from NMR 
analysis that each dsRBD interacts independently with 
pre-miR-155. 

The formation of a complex between TRBP-D12 and 
pre-miR-155 in solution was evaluated by sedimentation 
velocity analytical ultracentrifugation (svAUC). A series 
of samples were prepared to characterize the behaviour of 
the free protein, free RNA and the protein/RNA complex 
under different conditions. SEC-MALLS (Figure lb) 
strongly suggests that TRBP-D12 is monomeric in 
solution. This observation was supported by svAUC data 
at 10 and 20 uM, which revealed an experimental sedimen- 
tation coefficient, s exp , of 2. 19 S and which was independent 
of the protein concentration used (Figure 4b). A frictional 
ratio of 1.43 for TRBP-D12 was calculated by combining 
s Qxp and the molecular mass (22.3 kDa) of the monomeric 
protein. This value is in excellent agreement with one pre- 
viously reported for a protein composed of two dsRBDs 
connected by a flexible linker (51). When a single sediment- 
ing species is considered, a molar mass of 22.4 kDa is 
calculated from the sedimentation profile of TRBP-D12, 
without assumptions about the shape of the sedimenting 
species, which is in excellent agreement with the expected 
molecular mass. Similar svAUC experiments indicated that 
pre-miR-155 is largely monomeric in solution at sample 
concentrations of 1 and 2 uM. A s Qxp of 4.4 S is obtained 
for pre-miR-155 at both 1 jiM and 2uM, which corres- 
ponds to a frictional ratio of 1.35. Non-interacting 
species analysis gives a molecular mass of 18.5 kDa, 
which is again close to the theoretical value of 19.4 kDa. 
An additional minor contribution at 7.2 S (6% of the main 
peak) was also detected. This peak is likely to result from a 
small fraction of dimerized pre-miR-155. This species was 
also observed by SEC-MALLS (data not shown). 

SvAUC analysis of the TRBP-D12/pre-miR-155 
complex was performed using three samples, each with a 
different excess of protein. In the resulting sedimentation 
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Figure 4. Characterization of the interaction between TRBP-D12 and pre-miR-155. (a) An example of ITC data recorded of the titration of 
pre-miR-155 with TRBP-D12. The top panel shows power vs. time together with a schematic of the components of the sample. The bottom 
panel shows the integrated injection enthalpy per mole of injectant plotted as a function of the ratio of protein and RNA concentrations. The 
data were fitted to a two-site sequential binding model shown by a solid black line. The two dissociation constants are given. The thermodynamic 
parameters calculated are provided in Supplementary Table S2. (b) Comparison of c(s) distributions of TRBP-D12 (grey) and pre-miR-155 (purple); 
(c) c(s) distributions of a sample prepared with 10 uM TRBP-D12 and different concentrations of pre-miR-155. The protein and/or RNA concen- 
tration corresponding to each curve is provided. 



profiles, the RNA contribution at 4.4 S has disappeared 
and a new faster contribution is visible, which sediments at 
5.9 S (Figure 4c). The composition of the 5.9 S peak was 
estimated using the data obtained from the three different 
optical signals. The ratios of these signals as well as 
theoretical values calculated for TRBP-D12/pre-miR-155 
complexes with different stoichiometries are given in 
Supplementary Table SI. The experimental ratios 
measured in the c(s) analysis for the fast boundary 
(Supplementary Table SI) reflect an enrichment of 
protein in the complex and strongly suggest that a 
protein/RNA complex with a stoichiometry of 2:1 can 
be formed in solution. 

The combination of ITC and AUC analyses demon- 
strates that two molecules of TRBP-D12 can interact 
with a single molecule of pre-miR-155. These data are 
consistent with ITC analysis of the two single-domain 
constructs, which demonstrated that four molecules of 
either TRBP-D1 or TRBP-D2 can interact with a single 
pre-miR-155. It is therefore likely that pre-miR-155 has a 
minimum of four dsRBD-binding sites. 

The RNA-binding region of TRBP interacts with the 
substrates and products of Dicer 

Dicer excises the apical loop of pre-miR-155 (nucleotides 
24-39) to generate a largely double-stranded molecule 
containing the 23-nucleotide miR-155 and its complement, 
miR-155* (Supplementary Figure S3a). ITC experiments 
were conducted to characterize the interaction between 
miR-155/miR-155* and the RNA-binding region of 
TRBP (Figure 5a). These data revealed many similarities 
to those measured for the interaction between TRBP-D12 



and pre-miR-155. The interaction between TRBP-D12 
and miR-155/miR-155* was also exothermic in character 
(Figure 4a). In addition, the binding isotherms were also 
optimally described by a two-step sequential binding 
model (K d5l = 1.0 ± 0.7 uM; K d?2 = 5.7 ± 0.3 uM; 
Supplementary Table S2). Finally, as was seen for the 
TRBP-D12/pre-miR-155 complex, the first molecule of 
TRBP-D12 binds miR-155/miR-155* more strongly than 
the second molecule. The difference in dissociation con- 
stants is again likely due to the reduced probability of the 
second molecule of TBRP-D12 binding. 

NMR spectroscopy was used to assess which regions of 
TRBP-D12 interact with the miR-155/miR-155* duplex. 
Large chemical shift changes are observed in 2D ^H^N) 
HSQC spectra of [ 15 N]-labelled TRBP-D12 on the 
addition of unlabelled miR-155/miR-155* (Figure 5b 
and Supplementary Figure S5b and c). Comparison of 
the spectra of the two TRBP-D12/RNA complexes 
shows little difference between the magnitude or 
sequence location of chemical shift perturbations. The 
changes in chemical shift for TRBP-D12 on binding 
miR-155/miR-155* are largest in the three RNA inter- 
action regions of each dsRBD (Supplementary Figure 
S3b). Again, residues in the linker are barely perturbed. 
NMR spectra of TRBP-D12 in complex with either 
pre-miR-155 or miR-155/miR-155* are nearly identical 
(Figure 5b and Supplementary Figure S5b and c). 
Site-by-site differences in ^H^N) resonance frequencies 
between single-domain and double-domain data sets are 
very small, with RMSD values of 0.02 and O.llppm for 
! H and 15 N frequencies, respectively (Supplementary 
Figure S5c). These data suggest that the complexes 
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Figure 5. Interaction of the RNA-binding region of TRBP with the substrate and product of Dicer, (a) An example of an ITC analysis of the 
titration of miR-155/miR-155* with TRBP-D12 showing power vs. time data (top, with a schematic of the components of the sample) and the 
integrated injection enthalpy per mole of injectant plotted as a function of the protein/RNA ratio (bottom). The data were fitted to a two-step 
sequential binding model shown by a solid black line. The two dissociation constants and their standard deviations are given. The complete 
thermodynamic parameters derived are provided in Supplementary Table S2. (b) Comparison of the changes in ^H^N) NMR spectra of 
TRBP-D12 resulting from the addition of either pre-miR-155 (left) or miR-155/miR-155* (right). The nucleotide sequence and predicted secondary 
structure of each RNA are provided. ^H^N) assignments of cross-peaks in each spectrum are annotated and coloured according to the dsRBD in 
which they reside (red, dsRBD-1; green, dsRBD-2). 



formed between TRBP-D12 and either pre-miR-155 or 
miR-155/miR-155* are highly similar. 

ITC and NMR data demonstrate that TRBP-D12 inter- 
acts with two different precursors of miR-155. These data 
support the formation of two highly similar complexes, 
which are dependent on the same protein/RNA inter- 
action surfaces. These data also suggest that the apical 
loop of pre-miR-155 is neither directly recognized by 
TRBP nor is it required for the interaction. The similarity 
in the measured affinities also suggests that the RNA- 
binding region of TRBP alone cannot differentiate 
between the two precursor molecules. 



DISCUSSION 

dsRBDs often appear in modular RNA-binding proteins, 
either in multiple copies or in combination with other 
functional domains such as RNase III, RNA helicases 
and PAZ domains (47). Structural studies of modular 
proteins containing single or multiple copies of dsRBDs 
have frequently revealed that these domains do not 
interact with other structured regions in the same 
protein. An NMR spectroscopic study of the dsRNA- 
binding region of PKR revealed no interaction between 
the two dsRBDs (52). Likewise, the two dsRBDs of 
ADAR2 (50) and HYL1 (19) were also shown to behave 
independently. In PKR, ADAR2 and HYL1, a long, un- 
structured linker connects the two dsRBDs. We have 
shown here that the first two dsRBDs of TRBP are also 
independent. Solution NMR analysis of different con- 
structs of the RNA-binding region of TRBP revealed no 
contact between the two dsRBDs (Figure lc and 



Supplementary Figure SI), while SEC-MALLS indicated 
that each isolated dsRBD is monomeric in solution. 

NMR (Figure 3) and ITC analysis (Supplementary 
Figure S4) of TRBP-D1 and TRBP-D2 revealed that 
each isolated domain interacts with pre-miR-155, which 
is consistent with previous conclusions that these 
two dsRBDs constitute the main interaction site for 
immature miRNAs in TRBP (18,31). However, little bio- 
physical or structural analysis of how multiple-domain 
constructs of TRBP interact with immature miRNAs 
has been previously reported. The NMR analysis of the 
interaction between TRBP-D12 and pre-miR-155 pre- 
sented here reveals several interesting features: first, com- 
parison of NMR spectra of single- and double-domain 
constructs in complex with pre-miR-155 shows no 
evidence of an interaction between the two dsRBDs 
(Figure 3). NMR spectra of each complex studied 
revealed no examples of peak doubling that would typic- 
ally indicate the formation of multiple different protein/ 
RNA or protein/protein interfaces. Furthermore, the very 
small perturbations of the ^H^N) chemical shifts of the 
inter-domain linker indicate that this region does not 
interact with pre-miR-155 nor does RNA binding induce 
contacts with either dsRBD. The elevated signal intensity 
of residues in the linker with respect to the two dsRBDs is 
consistent with this region remaining flexible in the 
TRBP-D12/pre-miR-155 complex. Taken together, these 
data strongly suggest that each dsRBD interacts with 
pre-miR-155 independently. This mode of interaction is 
similar to that used by ADAR2. A recently elucidated 
3D structure of the complex formed by ADAR2 and a 
71-nt stem-loop RNA revealed that the two dsRBDs 
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bind to distinct regions of the ds stem without any 
inter-domain contact or contribution from the 
inter-domain linker (50,53). 

dsRBDs in multiple arrays do not always interact with 
dsRNA independently. An interesting example that 
contrasts TRBP is DGCR8, a component of the 
Microprocessor complex, which performs the first process- 
ing step in miRNA biogenesis. The two dsRBDs of 
DGCR8 interact with each other through a large interface, 
which fixes their relative orientation and has important 
implications for pri-miRNA recognition. DGCR8 inter- 
acts with pri-miRNAs in preference to pre-miRNAs 
(54,55), whereas our data show that TRBP can interact 
with both pre-miRNAs and miRNA/miRNA* duplexes 
with similar affinities. These different patterns of RNA 
selectivity displayed by DGCR8 and TRBP could result 
from the contrasting ways in which the tandem dsRBDs 
interact with immature miRNAs. The compact structure 
formed by the DGCR8 core means that the two dsRBDs 
are not independent and that a pri-miRNA must bend to 
interact with both RNA-binding surfaces (56). A sufficient 
conformational change may not be feasible for the shorter 
pre-miRNA, leading to the preference for pri-miRNAs. 
Conversely, given that TRBP has been implicated in 
both the Dicer cleavage reaction and the transfer of the 
miRNA/miRNA* duplex to RISC, the independence of 
its two dsRBDs demonstrated here may reflect a broader 
range of binding targets. Taken together, these results 
suggest that there are considerable differences in how 
RNA is recognized by the different accessory proteins 
involved in the two miRNA biogenesis steps. 

A second important feature of the TRBP-D 12/pre-miR- 
155 complex is the stoichiometry. Both ITC (Figure 4a 
and Supplementary Figure S4) and svAUC (Figure 4b) 
experiments indicate that a complex containing four 
dsRBDs can be formed; that is, a single pre-miR-155 inter- 
acts with four single dsRBD constructs or two double- 
domain constructs. ITC data of the interactions between 
TRBP-D12 and pre-miR-155, and between TRBP-D12 
and miR155/miR155*, revealed no evidence of positive 
co-operativity. This result is consistent with earlier 
studies (34). 

The data presented here indicate that one pre-miR-155 
can associate with four dsRBDs to form a complex in 
which the dsRBDs interact with similar sites on the 
RNA but do not form inter-domain contacts. Is there suf- 
ficient space on pre-miR-155 to permit this? The theoret- 
ical secondary structure of pre-miR-155 is typical of 
pre-miRNAs and comprises an imperfect ds stem contain- 
ing two GU base-pairs, two unpaired nucleotides and a 
four-base apical loop (Figure 5b). The majority of 
pre-miRNAs in the miRbase (23) are predicted to adopt 
3D structures similar to A-form dsRNA (57). Three- 
dimensional structures of large stem-loop RNAs have 
revealed that features such as non- Watson-Crick base- 
pairs or nucleotide bulges often cause minimal perturb- 
ation to the overall ds character (53,58). Canonical 
dsRBDs typically interact with a 12-16-bp region of 
dsRNA (47). However, the RNA-binding region of 
PKR, which also contains two dsRBDs, can interact 
with 20-40-bp RNA duplexes with stoichiometries that 



are incompatible with a simple ID lattice model and 
which would require binding to multiple surfaces of the 
dsRNA with a binding site overlap of 4 bp (51). Several 
3D structures of protein/dsRNA complexes have also 
demonstrated that multiple dsRBDs can bind in proximity 
without forming inter-dsRBD interactions (53,59). 

A structural model of the interaction between pre-miR- 
155 and dsRBDs was generated to evaluate the feasibility 
of a protein/RNA complex containing the four dsRBDs 
per pre-miR-155 suggested by ITC and AUC analysis 
(Supplementary Figure S6). The model demonstrates 
that it is theoretically possible to arrange four non- 
contacting dsRBDs on a single molecule of pre-miR-155. 
The dsRBDs tessellate optimally when they interact with 
the RNA in alternating directions. The inter-dsRBD 
linker in TRBP is more than 60 residues and therefore 
places little limitation on the respective orientation of 
the two dsRBDs in the TRBP-D12 construct. In 
addition, NMR data suggest that the linker remains 
flexible in both of the TRBP-D 1 2/RNA complexes 
analysed here. The arrangement shown in Supplementary 
Figure S6 is also compatible with NMR and ITC data of 
the interaction between TRBP-D 12 and the Dicer product 
miR-155/miR-155*, as none of the four proposed dsRBD- 
binding sites in the model involve nucleotides from the 
excised apical loop. 

The functional stoichiometry of the Dicer/TRBP 
complex has not been determined (32). However, the 
report that a tertiary complex containing Dicer, TRBP 
and PACT can be formed implies that two TRBP-like 
proteins can interact simultaneously with Dicer in the 
presence of pre-miRNAs (21). In addition, the plant 
DCL1 accessory protein HYL1 has been shown to form 
2:1 protein/RNA complexes with dsRNA sequences of 
suitable length (18). Thus, high occupancy of dsRBDs on 
a relatively short sequence of dsRNA, as seen here with 
TRBP and two different precursors of miR-155, is poten- 
tially functionally relevant. Whether the 2:1 ratio observed 
here is a standard feature of the complexes formed between 
TRBP and immature miRNAs or whether the stoichiom- 
etry is determined by features in the target immature 
miRNA, such as the length or percentage of canonical 
dsRNA, remains to be determined. 

Much of the information concerning the role of TRBP 
in small regulatory RNA synthesis has been obtained from 
studies into siRNAs. The data presented here represent 
the first in-depth biophysical analysis of the interaction 
between TRBP and immature miRNAs that have an im- 
perfect dsRNA stem. Analysis of ITC data yielded macro- 
scopic dissociation constants for the TRBP/pre-miR-155 
complex in the low uM range, which is also consistent with 
other investigations of dsRBD/dsRNA interactions 
(18,19,56,60,61). The affinities measured for the isolated 
dsRBDs of TRBP indicate that TRBP-D2 interacts with 
pre-miR-155 approximately 4 times more strongly than 
TRBP-D 1. This observation is consistent with earlier 
interaction studies of TRBP and siRNAs (31). 
Differences between the binding affinities of dsRBDs in 
tandem or multi-dsRBD proteins are relatively common — 
e.g. HYL1 (18,19) or PKR (45)— and may play a role in 
target selection by fine-tuning the overall affinity of a 
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protein for dsRNA molecules that possess many potential 
dsRBD-binding sites. 

In addition to its function in the Dicer complex, TRBP 
is a component of the RLC (4,11,12). The precise role of 
TRBP in the formation and/or activity of RISC is less 
clear. We have shown here that the RNA-binding region 
of TRBP interacts with both pre-miR-155 and miR-155/ 
miR-155* — i.e. both the substrate and product of Dicer — 
with a comparable affinity (Figure 5a). Furthermore, 
NMR spectra of TRBP-D12 in complex with either 
pre-miR-155 or miR-155/miR-155* are highly similar 
(Figure 5b), which indicates that TRBP interacts with 
pre-miR-155 and miR-155/miR-155* through essentially 
identical sites. These results suggest that TRBP could par- 
ticipate in the recognition of Dicer substrates as well as the 
transfer of the cleavage product to RISC through the 
RLC. 



CONCLUSIONS 

Despite several structural studies, many of the mechanistic 
details that underpin human miRNA biogenesis remain 
unclear. One area in which the link between 3D structure 
and function remains particularly obscure concerns the 
role of the non-catalytic proteins of that associate with 
Drosha or Dicer. We have demonstrated here that the 
region of TRBP responsible for interacting with 
immature miRNAs is composed of two dsRBDs that rec- 
ognize RNA independently. The linker connecting the two 
dsRBDs in the RNA-binding region of TRBP remains 
flexible and does not interact with the bound RNA. 
Therefore, the formation of a complex between TRBP 
and immature miRNAs is unlikely to require bending of 
the ds stem of the RNA, as has been reported for DGCR8. 
Finally, the observation that RNA-binding region of 
TRBP can interact with both the substrates and 
products of Dicer suggests that TRBP may function at 
several stages in the miRNA biogenesis pathway. 
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